NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Evolution of Automated Software Repair

https://doi.org/10.1109/TSE.2025.3533309

Le_Goues, Claire; Nguyen, ThanhVu; Forrest, Stephanie; Weimer, Westley (January 2025, IEEE Transactions on Software Engineering)

Full Text Available
BatFix: Repairing language model-based transpilation

https://doi.org/10.1145/3658668

Ramos, Daniel; Lynce, Inês; Manquinho, Vasco; Martins, Ruben; Le_Goues, Claire (July 2024, ACM Transactions on Software Engineering and Methodology)

To keep up with changes in requirements, frameworks, and coding practices, software organizations might need to migrate code from one language to another. Source-to-source migration, or transpilation, is often a complex, manual process. Transpilation requires expertise both in the source and target language, making it highly laborious and costly. Languages models for code generation and transpilation are becoming increasingly popular. However, despite capturing code-structure well, code generated by language models is often spurious and contains subtle problems. We proposeBatFix, a novel approach that augments language models for transpilation by leveraging program repair and synthesis to fix the code generated by these models.BatFixtakes as input both the original program, the target program generated by the machine translation model, and a set of test cases and outputs a repaired program that passes all test cases. Experimental results show that our approach is agnostic to language models and programming languages.BatFixcan locate bugs spawning multiple lines and synthesize patches for syntax and semantic bugs for programs migrated fromJavatoC++andPythontoC++from multiple language models, including, OpenAI’sCodex.
more » « less
Full Text Available
Syntax Is All You Need: A Universal-Language Approach to Mutant Generation

https://doi.org/10.1145/3643756

Deb, Sourav; Jain, Kush; van_Tonder, Rijnard; Le_Goues, Claire; Groce, Alex (July 2024, Proceedings of the ACM on Software Engineering)

While mutation testing has been a topic of academic interest for decades, it is only recently that “real-world” developers, including industry leaders such as Google and Meta, have adopted mutation testing. We propose a new approach to the development of mutation testing tools, and in particular the core challenge ofgenerating mutants. Current practice tends towards two limited approaches to mutation generation: mutants are either (1) generated at the bytecode/IR level, and thus neither human readable nor adaptable to source-level features of languages or projects, or (2) generated at the source level by language-specific tools that are hard to write and maintain, and in fact are often abandoned by both developers and users. We propose instead that source-level mutation generation is a special case ofprogram transformationin general, and that adopting this approach allows for a single tool that can effectively generate source-level mutants for essentiallyanyprogramming language. Furthermore, by usingparser parser combinatorsmany of the seeming limitations of an any-language approach can be overcome, without the need to parse specific languages. We compare this new approach to mutation to existing tools, and demonstrate the advantages of using parser parser combinators to improve on a regular-expression based approach to generation. Finally, we show that our approach can provide effective mutant generation even for a language for which it lacks any language-specific operators, and that is not very similar in syntax to any language it has been applied to previously.
more » « less
Full Text Available
Automated Program Repair, What Is It Good For? Not Absolutely Nothing!

https://doi.org/10.1145/3597503.3639095

Eladawy, Hadeel; Le_Goues, Claire; Brun, Yuriy (April 2024, ACM)

Industrial deployments of automated program repair (APR), e.g., at Facebook and Bloomberg, signal a new milestone for this exciting and potentially impactful technology. In these deployments, developers use APR-generated patch suggestions as part of a human-driven debugging process. Unfortunately, little is known about how using patch suggestions affects developers during debugging. This paper conducts a controlled user study with 40 developers with a median of 6 years of experience. The developers engage in debugging tasks on nine naturally-occurring defects in real-world, open-source, Java projects, using Recoder, SimFix, and TBar, three state-of-the-art APR tools. For each debugging task, the developers either have access to the project's tests, or, also, to code suggestions that make all the tests pass. These suggestions are either developer-written or APR-generated, which can be correct or deceptive. Deceptive suggestions, which are a common APR occurrence, make all the available tests pass but fail to generalize to the intended specification. Through a total of 160 debugging sessions, we find that access to a code suggestion significantly increases the odds of submitting a patch. Correct APR suggestions increase the odds of debugging success by 14,000%, but deceptive suggestions decrease the odds of success by 65%. Correct suggestions also speed up debugging. Surprisingly, we observe no significant difference in how novice and experienced developers are affected by APR, suggesting that APR may find uses across the experience spectrum. Overall, developers come away with a strong positive impression of APR, suggesting promise for APR-mediated, human-driven debugging, despite existing challenges in APR-generated repair quality.
more » « less
Full Text Available
Large Language Models for Test-Free Fault Localization

https://doi.org/10.1145/3597503.3623342

Yang, Aidan_Z H; Le_Goues, Claire; Martins, Ruben; Hellendoorn, Vincent (February 2024, ACM)

Full Text Available
Contextual Predictive Mutation Testing

https://doi.org/10.1145/3611643.3616289

Jain, Kush; Alon, Uri; Groce, Alex; Le_Goues, Claire (November 2023, ACM)

Full Text Available
Mind the Gap: The Difference Between Coverage and Mutation Score Can Guide Testing Efforts

https://doi.org/10.1109/ISSRE59848.2023.00036

Jain, Kush; Kalburgi, Goutamkumar Tulajappa; Le_Goues, Claire; Groce, Alex (October 2023, IEEE)

Full Text Available
CAT-LM Training Language Models on Aligned Code And Tests

https://doi.org/10.1109/ASE56229.2023.00193

Rao, Nikitha; Jain, Kush; Alon, Uri; Le_Goues, Claire; Hellendoorn, Vincent J (September 2023, IEEE)

Full Text Available

Search for: All records